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(xi) SEQUENCE DESCRIPTION : SEQ ID NO : 1 : 

GTTTAAAAAA AAC CACCAAG CGAGTTGGAC GCGTAACTCT TTGTAACGGA TCTCGGAACG 60 

CCGTGGGAGT CGGAAAATCG CTGGACGCGT GTTCGTGCGT TTGCATGTGT GCGTGCGTTC 120 

45 GTGTGTGTGT GTGTGCTAAT GTGCGAGCGG GTGAGCGAAT AAAAATAAAT ATATATCGTC 18 0 

AAGTCAGGCT TAAGAAATGT GCGCTAATCA AAGAAAATGC CCCCAATTCT GGCCAATTGA 240 

GAATTGTGGC TAAACAAAAA ATTCGACCGG AGTTCAAAAA TAAACAATCC AGTGAATAAA 3 00 

CACACAAAAT CAATCAAAAA AGAAGATTTT TCTTTTTTAT TTTCGCTTTT AATTTATTAA 3 60 

CGAGAATAAT AAATAAATAA ATAAATAAAT ATAAACAAAA ATAAAAATAT AAGAAAAGTG 420 

50 TACGTGACAA GAGCTCGAAA AGAAGTTGCA ACAAATAGCA AAAATAATTC GTGCGTGCGA 48 0 

AAAAGTGCTG CGAAGTTTTA TGGCCCATGC AAAAAGTGCT AAATTTGTAA ATGGCATGGA 540 

AAGTGCAAAG CTCTGATTAA AAAACCCGCG AAGATTGGAG TGCGAGGTGC CGCCCAATAA 600 

CGCAACCAAC TACTGCCACA AGGAAATTAT TAAGACCAAT CAACGACCAA AAAAATAAAA 660 

AATAAAACAA AAGCAAG CAG AAATTTGGTG CTAGTTCTGT TTAGTCGACA GCCATCCACG 72 0 

55 TTGGATCCCC ATCGCAAATA ATGTCATCAA AATGTGCTTT CAACATTGTA TTCGTATCGA 7 80 
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TCATTTTCAT CATCATCGTA AATGGTTACG CAAAAGATAT TTCTGGAGTT AAAAGAGGTC 34 0 

ATGAACGACT TAACGAATAC ATATCCCACT ATGAAACACT CAACTATGAT CACGAGCACA 900 

TCCGAGCTAG TCACAATAGA GCGCGACGAT CAGTGACCAA AGATCAATAT GTACATTTAA 96 0 

AGTTTGCATC ACATGGAAGA GACTTCCATC TTAGATTAAA ACGTGATTTA AATACATTTA 1020 

5 GCAATAAGTT AGACTTTTAT GATAGCAAAG GTCCCATTGA TGTCTCCACG GATCATATCT 1080 

ATGAGGGCGA AGTGATAGGG GATCGTAATA GTTATGTATT TGGTTCCATA CACAATGGGG 114 0 

TATTCGAGGG TAAAATTATA ACGGAACGTG ATGCCTATTA TGTTGAACAT GCCAAACATT 12 0 0 

ATTTTCCCAC AAATCGCACG GCGACAACAA CACCACCATC GACTTCGACG ACATCCTCAG 126 0 

- CAACAACAGT CACAAAAAGC AC AC AAC C AA CACGGCCTTT GGCCAAAAGC AAC AC CAGTA 13 2 0 

10 CTACTGCCGT TAATAGTAAG ACAGAAAACT TTATAAAGAA AATTGCTGAA TCCACAACGA 13 8 0 

CGAGC C AGCA GCTTCCAGAA TATACCGAAT CGTCGTCGTC GTCGTCGACA AC AAC ATT C C 1440 

CACCCACAAC AGAGTATTTC GAGGACGAAA AGGAGCGTAA TGCCGAGGAC GAACTTGATT 15 0 0 

TTCACTCCAT TATCTACAAG GAGTCACATG TCGAGGACGC CTACGAAAAT GTGCGCGAAG 15 60 

GTCACGTGGC CGGCTGTGGC ATCACGGATG AGGTCTCTCA GTGGATGGAG AACATACAAA 162 0 

15 ATTCAGCCGT CGAAGAGTTG CCGGAGCCCA TGTCAAAGGA CTATCAAAAG CTCCACCGGA 16 80 

AGCAGCTGCA CAAAAAGTCC GCCCCACAGC AACAACAGCA GCCCCATCCG CCGAAGAAGT 174 0 

ACATCAGCGG GGATGAGGAC TTCAAGTATC CCCACCAGAA GTACACGAAG GAAGCTAACT 18 00 

TCGCCGAGGG TGCATTCTAC GATC CATCGA CCGGACGTCG CCTGGGCTCA TCCGCCAACG 1860 

TGGCCGACTG GCATCAGCTC GTC CACGAGC GCGTCCGCCG CGCCACCGAC AATGGTGCTG 192 0 

20 GGGATAGGGG CTCATCCGGT GGATCTGGAC GCGGTCGCGA GGACAACAAG AATACCTGCT 19 8 0 

r-r CGCTCTACAT TCAAACGGAT CCATTGATAT GGCGCCACAT ACGCGAAGGC ATTGCTGACC 2 04 0 

if ACGATCGTGG ACGCAAGTAC GAGGTGGATG AGAAAACGCG CGAGGAAATC ACATCGTTGA 2100 

Sf TTGCACATCA CGTGACGGCC GTTAATTACA TTTACCGCAA CACAAAGTTC GACGGACGCA 216 0 

fi CCGAGCATCG CAACATACGC TTTGAGGTGC AACGCATTAA GATCGATGAC GATTCGGCCT 222 0 

25 T 4 GTCGCAATTC CTACAATGGT CCACACAATG CCTTTTGCAA TGAACACATG GATGTCTCGA 22 8 0 

H ACTTTTTGAA TCTGCATTCC CTAGAAGATC ACTCGGACTT TTGTTTGGCT TACGTGTTCA 2340 

Zl CCTACAGAGA TTTCACTGGC GGCACTTTGG GTCTGGCCTG GGTGGCCAGT GCGTCGGGAG 24 0 0 

m CCTCTGGTGG AATTTGCGAG AAGTACAAGA CGTACACGGA AACGGTGGGT GGACAGTACC 24 60 

50 AGAGCACCAA GCGATCACTC AACACGGGCA TCATCACCTT TGTCAACTAC AACAGTCGGG 252 0 

30^ TGCCGCCGAA AGTGTCGCAG CTTACGTTGG CACACGAGAT TGGCCACAAC TTTGGATCAC 2580 

O CTCACGATTA CCCTCAGGAA TGTCGTCCTG GTGGCCTAAA TGGCAATTAC ATTATGTTCG 2 64 0 

W CCAGTGCCAC CTCCGGTGAT AGGC CAAAT A ACTCCAAGTT CTCGCCCTGC TCCATTCGGA 2 700 

W ACATCTCCAA TGTCCTTGAC GTGCTGGTGG GCAACACGAA GCGCGACTGC TTCAAGGCCT 2 760 

t~- CGGAAGGTGC CTTCTGCGGC AACAAGATCG TGGAGTCTGG CGAGGAATGC GACTGTGGCT 2 82 0 

35 0 TCAACGAGGA GGAGTGCAAG GACAAGTGCT GCTACCCGCG TCTGATCAGC GAGTACGACC 2 8 80 

H ! AGTCGCTGAA CTCCAGTGCC AAGGGATGCA CGCGCCGCGC CAAGACC CAG TGCTCACCAT 294 0 

CGCAGGG TCC GTGCTGTCTG TCCAACTCCT GCACCTTTGT GCCGACGAGC TACCACCAGA 3 000 

AGTGCAAGGA GGAGACGGAG TGCAGCTGGT CGAGCACATG CAACGGAACC ACGGCCGAGT 3 0 60 

GTCCGGAGCC ACGTCATCGC GATGACAAGA CCATGTGCAA CAATGGAACA GCGCTATGCA 312 0 

40" TCCGCGGTGA ATGTAGTGGA TCGCCATGTT TGCTCTGGAA TATGACAAAG TGCTTCCTTA 318 0 

CCTCGACCAC ACTGCCGCAC GTGAGCAAGC GCAAGTTGTG CGACTTGGCC TGCCAGGATG 2; 24 0 

GCAATGACAC CTCCACCTGC CGCAGCACCA GCGAGTTTGC CGATAAATAT AATATTCAAA 3300 

AGGGTGGTAT TAGTCTGCAG CCCGGTTCGC CATGCGATAA TTTCCAGGGC TACTGCGATG 3 3 60 

TGTTCCTTAA GTGTCGAGCC GTGGATGCCG ATGGTCCGCT TCTTCGGCTG AAGAATTTGT 34 2 0 

45 TGCTCAACCG GAAGACCCTG CAAACGGTGG CCGAGTGGAT CGTCGACAAT TGGTACCTAG 34 8 0 

TGGTTCTGAT GGGAGTGGCC TTTATTGTGG TCATGGGTTC GTTCATCAAA TGTTGTGCCG 3 54 0 

TGCACACGCC CAGTTC CAAT CCGAAGAAGC GACGAGCTCG TCGAATCAGC GAAACTCTAA 3 60 0 

GAGCACCCAT GAACACGTTG CGTAGAATGC AACGTCATCC CAATCAGCGA GGAGCAGGTC 3 660 

CTCGAAGCAT CCCACCGCCG GCACATGAGG CGCAGCATTA TTCACGCGGC GGAGATGGTC 372 0 

50 GCGGCGGCGG CGGTGGAGGC GGAGGTCGCC ACGGTGGCTC TAGGTCACAC CATCAACAGC 378 0 

ATCCGCACGA TTGGGATCGT CATCAGGGTG GCCACTCAAT CGTCCCATTG CCCACCGGCG 3 840 

GCAGCCATTC AAGTCGCAAC TCGGCGGCGA ATCAAGCGAG AAGAAG CG AT GGACGAGGTC 3 900 

CACGATCCAC CAGCAGTGGG CGGCCGCAGG CTATAG C CAG CGGAAGCGGT GCCGCGAGCG 3 96 0 

GAGCAGCGCG ATCTCATGGC GGGTACGGAG CCGAACAGGC GATACCGGGT TCCATTGGTG 4 02 0 

55 GTGGTGTCCA GGCGGCCATT AGCAGCGGCG GTGTGGTGGC TCGGGCCCAG CTGCCGCTGC 40 8 0 
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CATTGCCGCC GCCAAATGGA CAGCAGCAAA TGCAACAGCA ACAACAACTG CAACTACAGC 414 0 

AACCGGCAAT TTCGCCGCAG CAGCAGCCGC AGCAAGCGTT CTACACGCCG AAAGAACTAC 42 00 

CACCACGCAA TAAGTCCCGA TCATCACGTA CCAACAACAC CTCCAACACC ACAACCACCA 42 60 

CCAACTCATC CACAGCGGCA GCCGGCAGTG GGTCGGTCTC GGGACCGGGC TCGGGGGCGG 4320 

GCAGTAGTAG TAAGAGCAAG AGCGGTAAAA GTGCCAAAGC CAAAGACTCA AAGTCGCAAA 4 3 80 

AATCGCAGCA GGCCAACAAC AGTCGCAGCA GCAGCAAGGA GAAGGGCGTC AAGCCAGTGC 4 440 

GCCGAAATAT CGTTTATTAG GAG CGGAACC ATCACATTGC CATACACAAC ACTGAACGAA 4 5 00 

ATATAGCCCC GAACCCAAAA TATCAAATGC AACCACATAT AGAATCGCCC GCTGCTAGTC 4 560 

ATCGAACTAC ATGTATGAGT TGTTGCTTCC CATCCACCGA CAAACACAAA CAGAAAAGAA 4 62 0 

ATTATAATGA TATTTCATTT AATCGATGCA ATTGGCGTCG CGCCGCCTCC GCTACAAGTA 4 68 0 

AGCTTTAGTC GG CCGACATC GTTGCACGAG CAACAGCAGC AGCAACATCA TCTGCAGCAG 4 74 0 

CAGCAGCAGC ATCAGCAGCA ACTGGAGCCG CAGCAGCAAC ACGCCTATGC CGATGCTTAT 4 8 00 

GCGGCCTTGG GGCGGGGCCA GTATGAGTCC ACCACGCGGG CGCCCAACAA CAGCAAGGTT 4 8 60 

TGACAGCCAA AAGTAGCAAT GGAGCGCCAC AAAAGGCCAA AGGCTAAGCG ACTCAAGCAG 4 920 

CAGAAGGAGC CGCATACACA GCAAACAACA ACACAGCAAC AAAAGCAAAA ACAACATAAA 4 98 0 

TCAAATGAAC TCAAATTAAA TGTAAATGTA ATTTTTATGC TAATTATTTT TATTTAAACA 5 04 0 

GTGTTTGTAT GCCACAAGGG AAAACAGCCA GCAACAAAAA GAAAAATACA AAAATAACAC 510 0 

AAAAAAGGAG ACAAATTTCG TAATACAGAA AAAGCTGAAA GTGAATGATA TTTTTGATTA 5160 

ACTAAATTAA AATGAAAATA CGAATGCAAA TTATGAATAA TAAAAGTAAT TAAAAACGAC 5 22 0 

AACATGCATA ATACATATAA AGTTGCAAGT TGCATATATA TACATTTGTA TGTATATATT 52 8 0 

TATTATGGAT ACACAATTAT TAAATAGCAG CAGCCACAAC AAACAAGTAA TATACATGAA 5340 

GAAAAACTAA GGTTTAATTG TATGAGAAAG CATTCTATAT GTCGGTGAGA TTTCTAAGCG 54 0 0 

CTAGGCCGAA ATACAAAATT AATTACACAC TTGAATAACA AAATGTGTTT TGTACAAAAA 54 60 

AAAAAAAATG AAATAAACAA AAACAGTGCG AATTAATTAA GCGTCATTAT AAAAAAAAGA 552 0 

ACGGAAACAA CAAAGCATTT AAATTGTATT TATCTGTACC GAAGCTAAAC GTTTATTTAA 5580 

AGCCGTCAAA ATTGCATTTG TAAACTAGCA AAACAAAAAA AAAAAAAAAC 563 0 



(2) INFORMATION FOR SEQ ID NO : 2 : 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 1239 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 
(ii) MOLECULE TYPE: peptide 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 2 : 
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Ala Thr Thr Thr Pro Pro Ser Thr Ser Thr Thr Ser Ser Ala Thr Thr 

165 170 175 

Val Thr Lys Ser Thr Gin Pro Thr Arg Pro Leu Ala Lys Ser Asn Thr 
180 185 190 

5 Ser Thr Thr Ala Val Asn Ser Lys Thr Glu Asn Phe He Lys Lys He 

195 200 205 

Ala Glu Ser Thr Thr Thr Ser Gin Gin Leu Pro Glu Tyr Thr Glu Ser 

210 215 220 

Ser Ser Ser Ser Ser Thr Thr Thr Phe Pro Pro Thr Thr Glu Tyr Phe 
10 225 230 235 240 

Glu Asp Glu Lys Glu Arg Asn Ala Glu Asp Glu Leu Asp Phe His Ser 

245 250 255 

He He Tyr Lys Glu Ser His Val Glu Asp Ala Tyr Glu Asn Val Arg 
260 265 270 

15 Glu Gly His Val Ala Gly Cys Gly He Thr Asp Glu Val Ser Gin Trp 

275 280 285 

Met Glu Asn He Gin Asn Ser Ala Val Glu Glu Leu Pro Glu Pro Met 
290 295 300 

: Ser Lys Asp Tyr Gin Lys Leu His Arg Lys Gin Leu His Lys Lys Ser 
20 305 310 315 320 

O Ala Pro Gin Gin Gin Gin Gin Pro His Pro Pro Lys Lys Tyr He Ser 

3 325 330 335 

fS Gly Asp Glu Asp Phe Lys Tyr Pro His Gin Lys Tyr Thr Lys Glu Ala 

%J 340 345 350 

25 Asn Phe Ala Glu Gly Ala Phe Tyr Asp Pro Ser Thr Gly Arg Arg Leu 

M 355 360 365 

Gly Ser Ser Ala Asn Val Ala Asp Trp His Gin Leu Val His Glu Arg 
St 370 375 380 

^ Val Arg Arg Ala Thr Asp Asn Gly Ala Gly Asp Arg Gly Ser Ser Gly 

30 JU 385 390 395 400 

fS Gly Ser Gly Arg Gly Arg Glu Asp Asn Lys Asn Thr Cys Ser Leu Tyr 

f\ 405 410 415 

rf He Gin Thr Asp Pro Leu He Trp Arg His He Arg Glu Gly He Ala 

^ 420 425 430 

35 J^f Asp His Asp Arg Gly Arg Lys Tyr Glu Val Asp Glu Lys Thr Arg Glu 

Fr* 435 440 445 

Glu He Thr Ser Leu He Ala His His Val Thr Ala Val Asn Tyr He 

450 455 460 

Tyr Arg Asn Thr Lys Phe Asp Gly Arg Thr Glu His Arg Asn He Arg 
40" 465 470 475 480 

Phe Glu Val Gin Arg He Lys He Asp Asp Asp Ser Ala Cys Arg Asn 

485 490 495 

Ser Tyr Asn Gly Pro His Asn Ala Phe Cys Asn Glu His Met Asp Val 
500 505 510 

45 Ser Asn Phe Leu Asn Leu His Ser Leu Glu Asp His Ser Asp Phe Cys 

515 520 525 

Leu Ala Tyr Val Phe Thr Tyr Arg Asp Phe Thr Gly Gly Thr Leu Gly 

530 535 540 

Leu Ala Trp Val Ala Ser Ala Ser Gly Ala Ser Gly Gly He Cys Glu 
50 545 550 555 560 

Lys Tyr Lys Thr Tyr Thr Glu Thr Val Gly Gly Gin Tyr Gin Ser Thr 

565 570 575 

Lys Arg Ser Leu Asn Thr Gly He He Thr Phe Val Asn Tyr Asn Ser 
580 585 590 

55 Arg Val Pro Pro Lys Val Ser Gin Leu Thr Leu Ala His Glu He Gly 
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595 600 605 

His Asn Phe Gly Ser Pro His Asp Tyr Pro Gin Glu Cys Arg Pro Gly 

610 615 620 

Gly Leu Asn Gly Asn Tyr lie Met Phe Ala Ser Ala Thr Ser Gly Asp 
5 625 630 635 640 

Arg Pro Asn Asn Ser Lys Phe Ser Pro Cys Ser lie Arg Asn lie Ser 

645 650 655 

Asn Val Leu Asp Val Leu Val Gly Asn Thr Lys Arg Asp Cys Phe Lys 
660 665 670 

10 Ala Ser Glu Gly Ala Phe Cys Gly Asn Lys He Val Glu Ser Gly Glu 

675 680 685 

Glu Cys Asp Cys Gly Phe Asn Glu Glu Glu Cys Lys Asp Lys Cys Cys 

690 695 700 

Tyr Pro Arg Leu lie Ser Glu Tyr Asp Gin Ser Leu Asn Ser Ser Ala 
15 705 710 715 720 

Lys Gly Cys Thr Arg Arg Ala Lys Thr Gin Cys Ser Pro Ser Gin Gly 

725 730 735 

Pro Cys Cys Leu Ser Asn Ser Cys Thr Phe Val Pro Thr Ser Tyr His 
740 745 750 

20 Gin Lys Cys Lys Glu Glu Thr Glu Cys Ser Trp Ser Ser Thr Cys Asn 

O 755 760 765 

y,J Gly Thr Thr Ala Glu Cys Pro Glu Pro Arg His Arg Asp Asp Lys Thr 

ID 770 775 780 

%| Met Cys Asn Asn Gly Thr Ala Leu Cys He Arg Gly Glu Cys Ser Gly 

25^ 785 790 795 800 

y ser Pro Cys Leu Leu Trp Asn Met Thr Lys Cys Phe Leu Thr Ser Thr 

CO 805 810 815 

fg Thr Leu Pro His Val Ser Lys Arg Lys Leu Cys Asp Leu Ala Cys Gin 

J 820 825 830 

3Ctj Asp Gly Asn Asp Thr Ser Thr Cys Arg Ser Thr Ser Glu Phe Ala Asp 

m 835 840 845 

1-j Lys Tyr Asn He Gin Lys Gly Gly He Ser Leu Gin Pro Gly Ser Pro 

jy. 850 855 860 

1^ Cys Asp Asn Phe Gin Gly Tyr Cys Asp Val Phe Leu Lys Cys Arg Ala 

35rf 865 870 875 880 

Val Asp Ala Asp Gly Pro Leu Leu Arg Leu Lys Asn Leu Leu Leu Asn 

885 890 895 

Arg Lys Thr Leu Gin Thr Val Ala Glu Trp He Val Asp Asn Trp Tyr 
: 900 905 910 

40 Leu Val Val Leu Met Gly Val Ala Phe He Val Val Met Gly Ser Phe 

915 920 925 

He Lys Cys Cys Ala Val His Thr Pro Ser Ser Asn Pro Lys Lys Arg 

930 935 940 

Arg Ala Arg Arg He Ser Glu Thr Leu Arg Ala Pro Met Asn Thr Leu 
45 945 950 955 960 

Arg Arg Met Gin Arg His Pro Asn Gin Arg Gly Ala Gly Pro Arg Ser 

965 970 975 

He Pro Pro Pro Ala His Glu Ala Gin His Tyr Ser Arg Gly Gly Asp 
980 985 990 

50 Gly Arg Gly Gly Gly Gly Gly Gly Gly Gly Arg His Gly Gly Ser Arg 

995 1000 1005 

Ser His His Gin Gin His Pro His Asp Trp Asp Arg His Gin Gly Gly 

1010 1015 1020 

His Ser He Val Pro Leu Pro Thr Gly Gly Ser His Ser Ser Arg Asn 
55 1025 1030 1035 1040 
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Ser Ala Ala Asn Gin Ala Arg Arg Ser Asp Gly Arg Gly Pro Arg Ser 

1045 1050 1055 

Thr Ser Ser Gly Arg Pro Gin Ala He Ala Ser Gly Ser Gly Ala Ala 
1060 1065 1070 

5 Ser Gly Ala Ala Arg Ser His Gly Gly Tyr Gly Ala Glu Gin Ala He 

1075 1080 1085 

Pro Gly Ser He Gly Gly Gly Val Gin Ala Ala He Ser Ser Gly Gly 

1090 1095 1100 

Val Val Ala Arg Ala Gin Leu Pro Leu Pro Leu Pro Pro Pro Asn Gly 
10 1105 1110 1115 1120 

Gin Gin Gin Met Gin Gin Gin Gin Gin Leu Gin Leu Gin Gin Pro Ala 

1125 1130 1135 

He Ser Pro Gin Gin Gin Pro Gin Gin Ala Phe Tyr Thr Pro Lys Glu 
1140 1145 1150 

15 Leu Pro Pro Arg Asn Lys Ser Arg Ser Ser Arg Thr Asn Asn Thr Ser 

1155 1160 1165 

Asn Thr Thr Thr Thr "Thr Asn Ser Ser Thr Ala Ala Ala Gly Ser Gly 
1170 1175 1180 

: Ser Val Ser Gly Pro Gly Ser Gly Ala Gly Ser Ser Ser Lys Ser Lys 
20 1185 1190 1195 1200 

M Ser Gly Lys Ser Ala Lys Ala Lys Asp Ser Lys Ser Gin Lys Ser Gin 

S 1205 1210 1215 

V Gin Ala Asn Asn Ser Arg Ser Ser Ser Lys Glu Lys Gly Val Lys Pro 

N 1220 1225 1230 

25 !M ; Val Arg Arg Asn lie Val Tyr 

kf 1235 

CO (2) INFORMATION FOR SEQ ID NO : 3 : 
s (i) SEQUENCE CHARACTERISTICS : 

30 O (A) LENGTH: 2796 base pairs 

III (B) TYPE: nucleic acid 

M (C) STRANDEDNESS : double 

jU (D) TOPOLOGY: linear 

P7 (ii) MOLECULE TYPE: cDNA 

35|^ (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 3: 

GAATTCCGGG TTTTGGAGGA GCTAGGAGCG TTGCCGGCCC CTGAAGTGGA GCGAGAGGGA 60 
GGTGCTTTCG CCGTTCTCCT GCCAGGGGAG GTCCCGGCTT CCCGTGGAGG CTCCGGACCA 12 0 

AGCCCCTTCA GCTTCTCCCT CCGGATCGAT GTGCTGCTGT TAACCCGTGA GGAGGCGGCG 18 0 

GCGGCGGCAG CGGCAGCGGA AGATGGTGTT GCTGAGAGTG TTAATTCTGC TCCTCTCCTG 240 
40 GGCGGCGGGG ATGGGAGGTC AGTATGGGAA TCCTTTAAAT AAATATATCA GACATTATGA 3 00 

AGGATTATCT TACAATGTGG ATTCATTACA CCAAAAACAC CAGCGTGCCA AAAGAGCAGT 3 60 

CTCACATGAA GACCAATTTT TACGTCTAGA TTTCCATGCC CATGGAAGAC ATTTCAACCT 42 0 

ACGAATGAAG AGGGACACTT CCCTTTTCAG TGATGAATTT AAAGTAGAAA CATCAAATAA 4 80 

AGTACTTGAT T ATGAT AC CT CTCATATTTA CACTGGACAT ATTTATGGTG AAGAAGGAAG 540 
45 TTTAGC CATG GGTCTGTTAT TGATGGAAGA TTTGAAGGAT TCATCCAGAC TCGTGGTGGC 60 0 

ACATTTTATG TTTGAGCCAG CAGAGAGATA TATTAAAGAC CGAACTCTGC CATTTCACTC 660 
TGTCATTTAT CATGAAGATG ATATTAACTA TCCCCATAAA TACGGTCCTC AGGGGGGCTG 720 
TGCAGATCAT TCAGTATTTG AAAGAATGAG GAAATACCAG ATGACTGGTG TAGAGGAAGT 7 80 

AACACAGATA CCTCAAGAAG AACATGCTGC TAATGGTCCA GAACTTCTGA GGAAAAAACG 84 0 

50 TACAAATTCA GCTGAAAAAA ATACTTGTCA GCTTTATATT CAGACTGATC ATTTGTTCTT 90 0 

TAAATATTAC GGAACACGAG AAGCTGTGAT TGCC CAGATA TCCAGTCATG TTAAAGCGAT 960 
TGATACAATT TACCAGACCA CAGACTTCTC CGGAATCCGT AACATCAGTT TCATGGTGAA 102 0 
ACGCATAAGA ATCAATACAA CTGCTGATGA GAAGGACCCT ACAAATCCTT TCCGTTTCCC 10 8 0 
AAATATTGGT GTGGAGAAGT TTCTGGAATT GAATTCTGAG CAGAATCATG ATGACTACTG 114 0 
55 TTTGGCCTAT GTCTTCACAG ACCGAGATTT TGATGATGGC GTACTTGGTC TGGCTTGGGT 12 0 0 
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TGGAGCACCT TCAGGAAGCT CTGGAGGAAT ATGTGAAAAA AGTAAACTCT ATTCAGATGG 12 60 

TAAGAAGAAG TC CTTAAACA CTGGAATTAT TACTGTTCAG AACTATGGGT CTCATGTACC 1.320 

TCCCAAAGTC TCTCACATTA CTTTTGCTCA CGAAGTTGGA CATAACTTTG GATCCCCACA 1.3 8 0 

TGATTCTGGA ACAGAGTGCA CACCAGGAGA ATCTAAGAAT TTGGGTCAAA AAGAAAATGG 144 0 

CAATTACATC ATGTATGCAA GAGCAACATC TGGGGACAAA CTTAACAACA ATAAATTCTC 150 0 

ACTCTGTAGT ATTAGAAATA TAAGCCAAGT TCTTGAGAAG AAGAGAAACA ACTGTTTTGT 1560 

TGAATCTGGC CAACCTATTT GTGGAAATGG AATGGTAGAA CAAGGTGAAG AATGTGATTG 162 0 

TGGCTATAGT GACCAGTGTA AAGATGAATG CTGCTTCGAT GCAAATCAAC CAGAGGGAAG 16 8 0 

AAAATGCAAA CTGAAACCTG GGAAACAGTG CAGTCCAAGT CAAGGTC CTT GTTGTACAGC 174 0 

ACAGTGTGCA TTCAAGTCAA AGTCTGAGAA GTGTCGGGAT GATTCAGACT GTGCAAGGGA 18 0 0 

AGGAATATGT AATGGCTTCA CAGCTCTCTG CCCAGCATCT GACCCTAAAC CAAACTTCAC 1860 

AGACTGTAAT AGGCATACAC AAGTGTGCAT TAATGGGCAA TGTGCAGGTT CTATCTGTGA 192 0 

GAAATATGGC TTAGAGGAGT GTACGTGTGC CAGTTCTGAT GGCAAAGATG ATAAAGAATT 198 0 

ATGCCATGTA TGCTGTATGA AGAAAATGGA CCCATCAACT TGTGCCAGTA CAGGGTCTGT 2 04 0 

GCAGTGGAGT AGGCACTTCA GTGGTCGAAC CATCACCCTG CAACCTGGAT CCCCTTGCAA 210 0 

CGATTTTAGA GGTTACTGTG ATGTTTTCAT GCGGTGCAGA TTAGTAGATG CTGATGGTCC 2160 

TCTAGCTAGG CTTAAAAAAG CAATTTTTAG TCCAGAGCTC TATGAAAACA TTGCTGAATG 222 0 

GATTGTGGCT CATTGGTGGG CAGTATTACT TATGGGAATT GCTCTGATCA TGCTAATGGC 228 0 

TGGATTTATT AAGATATGCA GTGTTCATAC TCCAAGTAGT AATCCAAAGT TGCCTCCTCC 234 0 

TAAACCACTT CCAGGCACTT TAAAGAGGAG GAGACCTCCA CAGCCCATTC AGCAACCCCA 24 0 0 

GCGTCAGCGG CCCCGAGAGA GTTATCAAAT GGGACACATG AGACGCTAAC TGCAGCTTTT 24 60 

GCCTTGGTTC TTCCTAGTGC CTACAATGGG AAAACTTCAC TCCAAAGAGA AACCTATTAA 252 0 

GTCATCATCT C C AAACT AAA CCCTCACAAG TAACAGTTGA AGAAAAAATG GCAAGAGATC 258 0 

ATATCCTCAG ACCAGGTGGA ATTACTTAAA TTTTAAAGCC TGAAAATTCC AATTTGGGGG 264 0 

TGGGAGGTGG AAAAGGAACC CAATTTTCTT ATGAACAGAT ATTTTTAACT TAATGGCACA 27 0 0 

AAGTCTTAGA ATATTATTAT GTGCCCCGTG TTCCCTGTTC TTCGTTGCTG CATTTTCTTC 2 7 60 

ACTTGCAGGC AAACTTGGCT CTCAATAAAC TTTTCG 2 7 96 



(2) INFORMATION FOR SEQ ID NO : 4 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 748 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 
(ii) MOLECULE TYPE: peptide 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 4 : 

Met Val Leu Leu Arg Val Leu He Leu Leu Leu Ser Trp Ala Ala Gly 

15 10 15 

Met Gly Gly Gin Tyr Gly Asn Pro Leu Asn Lys Tyr He Arg His Tyr 

20 25 30 

Glu Gly Leu Ser Tyr Asn Val Asp Ser Leu His Gin Lys His Gin Arg 

35 40 45 

Ala Lys Arg Ala Val Ser His Glu Asp Gin Phe Leu Arg Leu Asp Phe 

50 55 60 

His Ala His Gly Arg His Phe Asn Leu Arg Met Lys Arg Asp Thr Ser 
65 70 75 80 

Leu Phe Ser Asp Glu Phe Lys Val Glu Thr Ser Asn Lys Val Leu Asp 

85 90 95 

Tyr Asp Thr Ser His He Tyr Thr Gly His He Tyr Gly Glu Glu Gly 

100 105 110 

Ser Leu Ala Met Gly Leu Leu Leu Met Glu Asp Leu Lys Asp Ser Ser 

115 120 125 

Arg Leu Val Val Ala His Phe Met Phe Glu Pro Ala Glu Arg Tyr He 

130 135 140 

Lys Asp Arg Thr Leu Pro Phe His Ser Val He Tyr His Glu Asp Asp 
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145 150 155 160 

He Asn Tyr Pro His Lys Tyr Gly Pro Gin Gly Gly Cys Ala Asp His 

165 170 175 

Ser Vai Phe Glu Arg Met Arg Lys Tyr Gin Met Thr Gly Val Glu Glu 
5 180 185 190 

Val Thr Gin He Pro Gin Glu Glu His Ala Ala Asn Gly Pro Glu Leu 

195 200 205 

Leu Arg Lys Lys Arg Thr Asn Ser Ala Glu Lys Asn Thr Cys Gin Leu 
210 215 220 

10 Tyr lie Gin Thr Asp His Leu Phe Phe Lys Tyr Tyr Gly Thr Arg Glu 

225 230 235 240 

Ala Val He Ala Gin He Ser Ser His Val Lys Ala He Asp Thr He 

245 250 255 

Tyr Gin Thr Thr Asp Phe Ser Gly He Arg Asn He Ser Phe Met Val 
15 260 265 270 

Lys Arg He Arg He Asn Thr Thr Ala Asp Glu Lys Asp Pro Thr Asn 

275 ' 280 285 

Pro Phe Arg Phe Pro Asn He Gly Val Glu Lys Phe Leu Glu Leu Asn 
290 295 300 

20 Ser Glu Gin Asn His Asp Asp Tyr Cys Leu Ala Tyr Val Phe Thr Asp 

305 310 315 320 

Arg Asp Phe Asp Asp Gly Val Leu Gly Leu Ala Trp Val Gly Ala Pro 

325 330 335 

ft Ser Gly Ser Ser Gly Gly He Cys Glu Lys Ser Lys Leu Tyr Ser Asp 

25 F Xi 340 345 350 

f~; Gly Lys Lys Lys Ser Leu Asn Thr Gly He He Thr Val Gin Asn Tyr 

!S 355 360 365 

U3 Gly Ser His Val Pro Pro Lys Val Ser His He Thr Phe Ala His Glu 

W 370 375 380 

30; ; v *l Gly His Asn Phe Gly Ser Pro His Asp Ser Gly Thr Glu Cys Thr 

C3 385 390 395 400 

Hi Pro Gly Glu Ser Lys Asn Leu Gly Gin Lys Glu Asn Gly Asn Tyr He 

|ij 405 410 415 

Met Tyr Ala Arg Ala Thr Ser Gly Asp Lys Leu Asn Asn Asn Lys Phe 
35p 420 425 430 

jh* Ser Leu Cys Ser He Arg Asn He Ser Gin Val Leu Glu Lys Lys Arg 

435 440 445 

Asn Asn Cys Phe Val Glu Ser Gly Gin Pro He Cys Gly Asn Gly Met 
450 455 460 

40 Val Glu Gin Gly Glu Glu Cys Asp Cys Gly Tyr Ser Asp Gin Cys Lys 

465 470 475 480 

Asp Glu Cys Cys Phe Asp Ala Asn Gin Pro Glu Gly Arg Lys Cys Lys 

485 490 495 

Leu Lys Pro Gly Lys Gin Cys Ser Pro Ser Gin Gly Pro Cys Cys Thr 
45 500 505 510 

Ala Gin Cys Ala Phe Lys Ser Lys Ser Glu Lys Cys Arg Asp Asp Ser 

515 520 525 

Asp Cys Ala Arg Glu Gly He Cys Asn Gly Phe Thr Ala Leu Cys Pro 
530 535 540 

50 Ala Ser Asp Pro Lys Pro Asn Phe Thr Asp Cys Asn Arg His Thr Gin 

545 550 555 560 

Val Cys He Asn Gly Gin Cys Ala Gly Ser He Cys Glu Lys Tyr Gly 

565 570 575 

Leu Glu Glu Cys Thr Cys Ala Ser Ser Asp Gly Lys Asp Asp Lys Glu 
55 580 585 590 
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Leu Cys His Val Cys Cys Met Lys Lys Met Asp Pro Ser Thr Cys Ala 

595 600 605 

Ser Thr Gly Ser Val Gin Trp Ser Arg His Phe Ser Gly Arg Thr lie 

610 615 620 

Thr Leu Gin Pro Gly Ser Pro Cys Asn Asp Phe Arg Gly Tyr Cys Asp 

625 630 635 640 

Val Phe Met Arg Cys Arg Leu Val Asp Ala Asp Gly Pro Leu Ala Arg 

645 650 655 

Leu Lys Lys Ala lie Phe Ser Pro Glu Leu Tyr Glu Asn lie Ala Glu 

660 665 670 

Trp lie Val Ala His Trp Trp Ala Val Leu Leu Met Gly lie Ala Leu 

675 680 685 

lie Met Leu Met Ala Gly Phe lie Lys lie Cys Ser Val His Thr Pro 



Lys Arg Arg Arg Pro 'Pro Gin Pro lie Gin Gin Pro Gin Arg Gin Arg 

725 730 735 

: Pro Arg Glu Ser Tyr Gin Met Gly His Met Arg Arg 
20 740 745 

(2) INFORMATION FOR SEQ ID NO : 5 : 
if:! (i) SEQUENCE CHARACTERISTICS: 

m 7 (A) LENGTH: 2 098 base pairs 

25 y (B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 
W (D) TOPOLOGY : linear 

00 (ii) MOLECULE TYPE: cDNA 

00 (xi) SEQUENCE DESCRIPTION: SEQ ID NO:5: 

30 s GAATTCTGAG CAGAATCATG ATGACTACTG TTTGGCCTAT GTCTTCACAG ACCGAGATTT 60 

13 TGATGATGGC GTACTTGGTC TGGCTTGGGT TGGAGCAC CT TCAGGAAGCT CTGGAGGAAT 12 0 

IB ATGTGAAAAA AGTAAACTCT ATTCAGATGG TAAGAAGAAG TCCTTAAACA CTGGAATTAT 180 

y TACTGTTCAG AACTATGGGT CTCATGTACC TCCCAAAGTC TCTCACATTA CTTTTGCTCA 240 

%k CGAAGTTGGA CATAACTTTG GATCCCCACA TGATTCTGGA ACAGAGTGCA CACCAGGAGA 3 00 

35 Q ATCTAAGAAT TTGGGTCAAA AAGAAAATGG CAATTACATC ATGTATGCAA GAGCAACATC 3 60 

|1l TGGGGACAAA CTTAACAACA ATAAATTCTC ACTCTGTAGT ATTAGAAATA TAAGCCAAGT 42 0 

TCTTGAGAAG AAGAGAAACA ACTGTTTTGT TGAATCTGGC CAACCTATTT GTGGAAATGG 4 80 

AATGGTAGAA CAAGGTGAAG AATGTGATTG TGGCTATAGT GACCAGTGTA AAGATGAATG 540 

CTGCTTCGAT GCAAATCAAC CAGAGGGAAG AAAATGCAAA CTGAAACCTG GGAAACAGTG 600 

40" CAGTCCAAGT CAAGGTCCTT GTTGTACAGC ACAGTGTGCA TTCAAGTCAA AGTCTGAGAA 660 

GTGTCGGGAT GATTCAGACT GTGCAAGGGA AGGAATATGT AATGGCTTCA CAGCTCTCTG 72 0 

CCCAGCATCT GACCCTAAAC CAAACTTCAC AGACTGTAAT AGGCATACAC AAGTGTGCAT 7 80 

TAATGGGGTA AGCATTTAAC TATATGTTTT AAAATTTAAT TTTAGAAAAC TTGTTTTTCA 84 0 

GAAGAATTAT TGATGCTTAA AGCTACATAG TTAAAGTAAT TAATCTTGGT CTCTGTTTAA 90 0 

45 GTAATATTCC CTCACAAAAC CATGAATATA TTATGTGGCA TTCAATTAGC TACTAATTTG 960 

TCTTTCATCT TTCCATGTAC ATGTGGTTGA TATTCTCTAG AGAAACATAG TTGTACAACT 102 0 

CGGCATGTGA TTTGTCTATA ATATTTAAGT TTTATAAAAT AATATTT GAG TAGCCTAAAT 1080 

AAAAGAACTC TTTGGTCATC TTCTCTGAAT ATCAAACCTT CAAAGCTTTT GTGGCTGAAT 114 0 

ATCACTTTGC TCTACAGGAA AAAAATTTAA TTTTTCTTTC TTTATAGAAG AGCCGTAATA 120 0 

50 ACCAACATAA AATCGATCCT CATCTAATCT CTTGCTCTGC TTTTATTTCA TTTTTTTAAG 12 60 

TTGCCATTGC TTTAAAAGAT TTACTATCTT TCTTGGATTT ACTGTTTTTC AAATTTTTTC 1320 

AAATGTATTT ATGTAATTCA GTTTTGATAC TCATCTCTGT TTGTTTTTCA CTTTCATTTC 13 80 

CATTTAAATA TTTTGACATT GGAAGCTCAT ACTTGCCTGT CTGTTACTAT AAAAAATAGG 144 0 

TTTGACTGTA TAGGGATTAA ACAATTTGTC TTTTATTTTC TTCTAG CAAT GTGCAGGTTC 15 00 

55 TATCTGTGAG AAATATGGCT TAGAAGAGTG TACGTGTGCC AGTCTGATGG CAAAGATGAT 15 60 
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AAAGAATTAT GCCATGTATG CTGTATGAAG AAAAGTAAGG CTTTTAAAAA CACAAGATAT 1620 

AAAATTTGCC TCAAACTATT ATTTTCTCCT AAATTTTAAG TGTAAAACTT TGACCTACAG 1680 

TTTGGC CAGA TAATTTCCAG CTAAATCTGT CCTCTTGAGG AGATTATAAA TGTAACGTAG 174 0 

CATTGTGTCT CTATTATTAT GGTCTCTACA ATGTTTTAAA AATGATAAAC TAGACAAAAC 18 00 

GTTGCCAGCT TTACAGCAGT AATTTACATA AACACTGTTA GACTTTAAGT CATCGTGGAC 1860 

ACTGAGTCAA GACTTGCTGG TTGCTTGTTT ACATTGTAAC ATTTAATATG AATTACTGAT 192 0 

GGCGTTACCC AGC CTAACTA GAGAAGGTCT GTATAACATG TTATGGTAAT GATTTCAGTT 1980 

TTTTTTCCCT CTTTGTATTT GCACAACTGG GAAATCTGAT CTGCAACTTA TATTTGAATC 2 04 0 

TGACCTTCAG CTTATATTTG GCATTTCTTT TCCAGTGGAC CCATCAACTC CGGAATTC 2 0 98 



(2) INFORMATION FOR SEQ 10 NO : 6 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 265 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 
<D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 6 : 
: Asn Ser Glu Gin Asn His Asp Asp Tyr Cys Leu Ala Tyr Val Phe Thr 
15 10 15 

Asp Arg Asp Phe Asp Asp Gly Val Leu Gly Leu Ala Trp Val Gly Ala 

20 25 30 

Pro Ser Gly Ser Ser Gly Gly lie Cys Glu Lys Ser Lys Leu Tyr Ser 

35 40 45 

Asp Gly Lys Lys Lys Ser Leu Asn Thr Gly lie lie Thr Val Gin Asn 

50 55 60 

Tyr Gly Ser His Val Pro Pro Lys Val Ser His lie Thr Phe Ala His 
65 70 75 80 

Glu Val Gly His Asn Phe Gly Ser Pro His Asp Ser Gly Thr Glu Cys 

85 90 95 

Thr Pro Gly Glu Ser Lys Asn Leu Gly Gin Lys Glu Asn Gly Asn Tyr 

100 105 110 

lie Met Tyr Ala Arg Ala Thr Ser Gly Asp Lys Leu Asn Asn Asn Lys 

115 120 125 

Phe Ser Leu Cys Ser lie Arg Asn lie Ser Gin Val Leu Glu Lys Lys 

130 135 140 

Arg Asn Asn Cys Phe Val Glu Ser Gly Gin Pro lie Cys Gly Asn Gly 
145 150 155 160 

Met Val Glu Gin Gly Glu Glu Cys Asp Cys Gly Tyr Ser Asp Gin Cys 

165 170 175 

Lys Asp Glu Cys Cys Phe Asp Ala Asn Gin Pro Glu Gly Arg Lys Cys 

180 185 190 

Lys Leu Lys Pro Gly Lys Gin Cys Ser Pro Ser Gin Gly Pro Cys Cys 

195 200 205 

Thr Ala Gin Cys Ala Phe Lys Ser Lys Ser Glu Lys Cys Arg Asp Asp 

210 215 220 

Ser Asp Cys Ala Arg Glu Gly lie Cys Asn Gly Phe Thr Ala Leu Cys 
225 230 235 240 

Pro Ala Ser Asp Pro Lys Pro Asn Phe Thr Asp Cys Asn Arg His Thr 

245 250 255 

Gin Val Cys lie Asn Gly Val Ser lie 
260 265 



(2) INFORMATION FOR SEQ ID NO : 7 : 

(i) SEQUENCE CHARACTERISTICS: 
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(A) LENGTH: 24 81 base pairs 

(B) TYPE: nucleic acid 

( C ) S TRANDEDNES S : doub 1 e 

(D) TOPOLOGY: linear 
5 (ii) MOLECULE TYPE: cDNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 7 : 

CCGTGAGGAG GCGGCGGCCG GGAAGATGGT GTTGCCGACA GTGTTAATTC TGCTCCTCTC 60 

CTGGGCGGCG GGGCTGGGAG GTCAGTATGG AAATCCTTTA AATAAATATA TTAGACATTA 12 0 

1 TGAAGGATTA TCTTACAATG TGGATTCATT ACACCAAAAA CACCAGCGTG CCAAACGAGC 18 0 

10 AGTCTCACAT GAGGACCAGT TTTTACTTCT AGATTTC CAT GCTCATGGAA GACAGTTCAA 24 0 

CCTACGAATG AAGAGGGACA CTTCCCTTTT TAGTGATGAA TTTAAAGTAG AAACATCAAA 3 00 

TAAAGT A CTT GATTATGATA CCTCTCATAT TTACACTGGA CATATTTATG GTGAAGAAGG 3 60 

AAGCTTTAGT CATGGGTCTG TCATTGATGG AAGATTTGAA GGTTTCATCA AGACTCGTGG 42 0 

TGGCACGTTT TACATTGAGC CAGCAGAGAG ATACATTAAA GATCGAATCC TGCCATTTCA 48 0 

15 CTCTGTCATT TATCATGAAG ATGATATTAA CTATCCCCAT AAATACGGCC CACAGGGGGG 54 0 

CTGTGCAGAT CACTCCGTTT TTGAAAGGAT GAGGAAGTAC CAAATGACTG GAGTAGAGGA 600 

AGGAGCCCGG GCACATCCAG AGAAGCATGC TGCTAGTAGT GGTCCTGAGC TCCTGAGGAA 660 

AAAACG CAC A ACTCTGGCTG AAAGAAATAC TTGTCAGCTC TATATCCAGA CAGATCACCT 72 0 

GTTCTTTAAA TACTATGGAA CACGAGAAGC TGTGATTGCT CAGATATCCA GTCATGTTAA 78 0 

20 AGCAATTGAT ACAATTTACC AGACTACAGA CTTCTCCGGA ATCCGTAACA TCAGCTTCAT 84 0 

^ GGTGAAACGC ATAAGAATCA ATACAAC CTC TGATGAAAAA GACCCTACAA ATCCTTTCCG 90 0 

% TTTCCCAAAT ATTGGTGTGG AGAAGTTCCT GGAGTTGAAT TCTGAGCAGA ATCATGATGA 960 

% CTACTGCCTG GCCTATGTCT TCACAGACCG GGATTTTGAT GATGGTGTTC TTGGTCTGGC 1020 

CTGGGTTGGA GCACCTTCAG GAAGCTCTGG GGGAATATGT GAGAAAAGCA AGTTGTATTC 10 8 0 

25/? AGATGGCAAG AAGAAGTCAT TGAACACAGG CATCATTACT GTTCAGAACT ATGGCTCCCA 114 0 

H TGTGCCTCCC AAAGTCTCTC ATATTACGTT TGCTCATGAA GTTGGACATA ACTTTGGATC 12 00 

TCCACATGAT TCTGGAACAG AGTGTACTCC AGGAGAGTCT AAGAACTTAG GACAAAAAGA 12 60 

5 AAATGGCAAT TACATCATGT ATGCAAGAGC AACATCTGGG GACAAACTTA ACAACAACAA 132 0 

W ATTTTCACTC TGCAGCATTA GAAACATAAG CCAAGTGCTT GAGAAGAAGA GGAACAACTG 13 80 

30* TTTTGTTGAA TCTGGCCAGC CTATCTGTGG AAACGGGATG GTGGAACAAG GGGAAGAGTG 144 0 

O TGACTGTGGC TAC AGTGAC C AGTGCAAAGA TGATTGCTGC TTCGATGCCA ACCAGCCAGA 15 00 

111 GGGGAAGAAA TGCAAGCTGA AGCCTGGGAA GCAGTGCAGT CCGAGTCAAG GACCCTGCTG 15 60 

|i| TACAGCACAG TGTGCATTCA AGTCAAAGTC TGAAAAGTGC CGGGATGATT CTGACTGTGC 162 0 

AAAGGAAGGG ATATGCAATG GCTTCACAGC CCTTTGCCCA GCATCTGATC CCAAGCCCAA 168 0 

35 O CTTTACAGAC TGTAACAGGC ACACACAAGT GTGCATTAAT GGGCAATGTG CAGGTTCTAT 174 0 

S TTGTGAAAAG TATGACTTGG AGGAGTGCAC CTGTGCCAGC TCTGATGGCA AAGATAATAA 1800 

GGAATTATGC CATGTTTGCT GCATGAAGAA AATGGCTCCA TCAACTTGTG CCAGTACAGG 1860 

CTCTTTGCAG TGGAGCAAGC AGTTCAGTGG TCGGACTATC ACTCTGCAGC CGGGCTCTCC 192 0 

ATGTAATGAC TTCAGAGGCT ACTGTGATGT TTTCATGCGG TGCAGATTAG TAGATGCTGA 19 80 

40" TGGCCCTCTA GCTAGGCTGA AAAAAGCCAT TTTTAGTCCA CAACTCTATG AAAACATTGC 204 0 

TGAGTGGATT GTGGCTCACT GGTGGGCAGT ACTGCTTATG GGAATTGCCC TGATCATGTT 2100 

AATGGCTGGA TTTATCAAGA TTTGCAGTGT TCACACTCCA AGTAGTAATC CAAAGTTGCC 2160 

GCCTCCTAAA CCACTTCCAG GCACTTTAAA GAGGAGGAGA CCGCCACAGC CCATTCAGCA 222 0 

GCCCCCGCGT CAGAGGCCCC GAGAGAGTTA TCAAATGGGA CACATGCGAC GCTAATGCAG 22 8 0 

45 CTTTTGCCTT GGTTCTTCCT AGTGCCTACA GTGGGAAAAC TTCACTCCAA AGAGAAACCT 234 0 

GTTAAGTCAT CATCTGCAAA TGATACCCTT ACAGTTAATA GTTGAAGAAA AAATGGCAAG 24 0 0 

AGATCATGTC CTCAGATCAG GTGGAATTAC TCAAAATTTA AAGC CTGAAA ATTCCAATTT 24 60 

TGGGGGTGGG GGTGGGATGG G 24 81 

50 (2) INFORMATION FOR SEQ ID NO : 8 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 74 9 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 
55 (D) TOPOLOGY: linear 
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(ii) MOLECULE TYPE: peptide 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 8 : 

Met Val Leu Pro Thr Val Leu lie Leu Leu Leu Ser Trp Ala Ala Gly 

1 5 10 15 

Leu Gly Gly Gin Tyr Gly Asn Pro Leu Asn Lys Tyr He Arg His Tyr 

20 25 30 

Glu Gly Leu Ser Tyr Asn Val Asp Ser Leu His Gin Lys His Gin Arg 

35 40 45 

Ala Lys Arg Ala Val Ser His Glu Asp Gin Phe Leu Leu Leu Asp Phe 

50 55 60 

His Ala His Gly Arg Gin Phe Asn Leu Arg Met Lys Arg Asp Thr Ser 
65 70 75 80 

Leu Phe Ser Asp Glu Phe Lys Val Glu Thr Ser Asn Lys Val Leu Asp 

85 90 95 

Tyr Asp Thr Ser His He Tyr Thr Gly His He Tyr Gly Glu Glu Gly 

100 105 HO 

Ser Phe Ser His Gly Ser Val He Asp Gly Arg Phe Glu Gly Phe He 

115 120 125 

Lys Thr Arg Gly Gly Thr Phe Tyr He Glu Pro Ala Glu Arg Tyr He 

130 135 140 

Lys Asp Arg He Leu Pro Phe His Ser Val He Tyr His Glu Asp Asp 
145 150 155 160 

He Asn Tyr Pro His Lys Tyr Gly Pro Gin Gly Gly Cys Ala Asp His 

165 170 175 

Ser Val Phe Glu Arg Met Arg Lys Tyr Gin Met Thr Gly Val Glu Glu 

180 185 190 

Gly Ala Arg Ala His Pro Glu Lys His Ala Ala Ser Ser Gly Pro Glu 

195 200 205 

Leu Leu Arg Lys Lys Arg Thr Thr Leu Ala Glu Arg Asn Thr Cys Gin 

210 215 220 

Leu Tyr He Gin Thr Asp His Leu Phe Phe Lys Tyr Tyr Gly Thr Arg 
225 230 235 240 

Glu Ala Val He Ala Gin He Ser Ser His Val Lys Ala He Asp Thr 

245 250 255 

He Tyr Gin Thr Thr Asp Phe Ser Gly He Arg Asn He Ser Phe Met 

260 265 270 

Val Lys Arg He Arg He Asn Thr Thr Ser Asp Glu Lys Asp Pro Thr 

275 280 285 

Asn Pro Phe Arg Phe Pro Asn He Gly Val Glu Lys Phe Leu Glu Leu 

290 295 300 

Asn Ser Glu Gin Asn His Asp Asp Tyr Cys Leu Ala Tyr Val Phe Thr 
305 310 315 320 

Asp Arg Asp Phe Asp Asp Gly Val Leu Gly Leu Ala Trp Val Gly Ala 

325 330 335 

Pro Ser Gly Ser Ser Gly Gly He Cys Glu Lys Ser Lys Leu Tyr Ser 

340 345 350 

Asp Gly Lys Lys Lys Ser Leu Asn Thr Gly He He Thr Val Gin Asn 

355 360 365 

Tyr Gly Ser His Val Pro Pro Lys Val Ser His He Thr Phe Ala His 

370 375 380 

Glu Val Gly His Asn Phe Gly Ser Pro His Asp Ser Gly Thr Glu Cys 
385 390 395 400 

Thr Pro Gly Glu Ser Lys Asn Leu Gly Gin Lys Glu Asn Gly Asn Tyr 

405 410 415 

He Met Tyr Ala Arg Ala Thr Ser Gly Asp Lys Leu Asn Asn Asn Lys 
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420 

Phe Ser Leu Cys 
435 

Arg Asn Asn Cys 
450 

Met Val Glu Gin 
465 

Lys Asp Asp Cys 

Lys Leu Lys Pro 
500 

Thr Ala Gin Cys 
515 

Ser Asp Cys Ala 
530 

Pro Ala Ser Asp 
545 

Gin Val Cys lie 

Asp Leu Glu Glu 
580 

Glu Leu Cys His 
595 

Ala Ser Thr Gly 
610 

lie Thr Leu Gin 
625 

Asp Val Phe Met 

Arg Leu Lys Lys 
660 

Glu Trp lie Val 
675 

Leu lie Met Leu 
690 

Pro Ser Ser Asn 
705 

Leu Lys Arg Arg 

Arg Pro Arg Glu 
740 



Ser lie Arg Asn 
440 

Phe Val Glu Ser 
455 

Gly Glu Glu Cys 
470 

Cys Phe Asp Ala 
485 

Gly Lys Gin Cys 

Ala Phe Lys Ser 
520 

Lys Glu Gly lie 
535 

Pro Lys Pro Asn 
550 

Asn Gly Gin Cys 
565 

Cys Thr Cys Ala 

Val Cys Cys Met 
600 

Ser Leu Gin Trp 
615 

Pro Gly Ser Pro 
630 

Arg Cys Arg Leu 
645 

Ala lie Phe Ser 

Ala His Trp Trp 
680 

Met Ala Gly Phe 
695 

Pro Lys Leu Pro 
710 

Arg Pro Pro Gin 
725 

Ser Tyr Gin Met 



425 

lie Ser Gin Val 

Gly Gin Pro lie 
460 

Asp Cys Gly Tyr 
475 

Asn Gin Pro Glu 
490 

Ser Pro Ser Gin 
505 

Lys Ser Glu Lys 

Cys Asn Gly Phe 
540 

Phe Thr Asp Cys 
555 

Ala Gly Ser lie 
570 

Ser Ser Asp Gly 
585 

Lys Lys Met Ala 

Ser Lys Gin Phe 
620 

Cys Asn Asp Phe 
635 

Val Asp Ala Asp 
650 

Pro Gin Leu Tyr 
665 

Ala Val Leu Leu 

lie Lys lie Cys 
700 

Pro Pro Lys Pro 
715 

Pro lie Gin Gin 
730 

Gly His Met Arg 
745 



430 

Leu Glu Lys Lys 
445 

Cys Gly Asn Gly 

Ser Asp Gin Cys 
480 

Gly Lys Lys Cys 
495 

Gly Pro Cys Cys 
510 

Cys Arg Asp Asp 
525 

Thr Ala Leu Cys 

Asn Arg His Thr 
560 

Cys Glu Lys Tyr 
575 

Lys Asp Asn Lys 
590 

Pro Ser Thr Cys 
605 

Ser Gly Arg Thr 

Arg Gly Tyr Cys 
640 

Gly Pro Leu Ala 
655 

Glu Asn lie Ala 
670 

Met Gly He Ala 
685 

Ser Val His Thr 

Leu Pro Gly Thr 
720 

Pro Pro Arg Gin 
735 

Arg 
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